Detecting User Interests on Twitter via Seed Set Expansion
نویسندگان
چکیده
A central machine learning problem in user interest modeling is to learn what topics users in a social network (such as Twitter) are interested in. Indeed, this is a critical part of understanding users’ behavior. If performed with reasonable precision, it can be exploited in a wide range of applications. Much of the previous work on this problem involves computationally expensive text processing – making the approach language dependent and, thereby, not scalable to international markets. In this work, we propose a novel graph-based approach to interest modeling that is language independent. In particular, we start with a list of people who are influencers and the topics they are Known-For (also called seed set). First, we expand these Known-For topics to other influencers. Later, we learn InterestedIn topics for all users by propagating the known for topics through the social graph. Numerical results show that on the Twitter social network consisting of over 250M users, we are able to grow a seed set of 55K labeled accounts into 88% interest coverage. Additionally, survey results verify that the precision of the detected topics at this coverage is as high as 80%.
منابع مشابه
TUMS: Twitter-Based User Modeling Service
Twitter is today’s most popular micro-blogging service on the Social Web. As people discuss various fresh topics, Twitter messages (tweets) can tell much about the current interests and concerns of a user. In this paper, we introduce TUMS, a Twitter-based User Modeling Service, that infers semantic user profiles from the messages people post on Twitter. It features topic detection and entity ex...
متن کاملDetecting Illicit Drug Usage on Twitter
Social media accounts for a large amount of the data produced on the Internet, informing and connecting many users across the world. Some social media, like Twitter, reveal valuable information about their users and user interests. In this project, we use social media to study opioid abuse, an emerging epidemic resulting in over 20,000 deaths in 2014 alone. We build tools to extract information...
متن کاملTwitviz - Exploring Twitter network for your interests
In this work we introduce Twitviz, a desktop application that allows users to explore their social networks on Twitter based on their interests. Twitviz provides two main tools in the form social network graphs: the Keyword view displays relationships between keywords of interest and users tweeting about them; and the userʼs Network view emphasizing on reciprocal followers. We believe these vis...
متن کاملA Model for Detecting of Persian Rumors based on the Analysis of Contextual Features in the Content of Social Networks
The rumor is a collective attempt to interpret a vague but attractive situation by using the power of words. Therefore, identifying the rumor language can be helpful in identifying it. The previous research has focused more on the contextual information to reply tweets and less on the content features of the original rumor to address the rumor detection problem. Most of the studies have been in...
متن کاملPersonalized Filtering of the Twitter Stream
With the rapid growth in users on social networks, there is a corresponding increase in user-generated content, in turn resulting in information overload. On Twitter, for example, users tend to receive uninterested information due to their non-overlapping interests from the people whom they follow. In this paper we present a Semantic Web approach to filter public tweets matching interests from ...
متن کامل